International  Journal  of  Agricultural 
Science  and  Research  (IJASR) 

ISSN(P):  2250-0057;  ISSN(E):  2321-0087 
Vol.  8,  Issue  1,  Feb  2018, 129-136 
©  TJPRC  Pvt.  Ltd. 

ARIMA  FORECAST  OF  AREA,  PRODUCTION  AND  PRODUCTIVITY 
OF  RICE  IN  AMALGAMATED  RAIPUR  USING  R 
G.  BHAVANI1  &  A.  K.  SINGH2 

1 Student  M.  Sc.  (Agri),  Department  of  Agricultural  Statistics,  IGKV,  Raipur,  Chhattisgarh,  India 
2  Professor  and  Head,  Department  of  Agricultural  Statistics,  IGKV,  Raipur,  Chhattisgarh,  India 

ABSTRACT 

The  forecast  of  the  production  of  rice  over  future  time  series  in  Chhattisgarh  has  always  been  a  matter  of 
concern  to  the  planners  of  the  State,  especially  being  a  predominantly  tribal  region.  The  present  study  aims  at  patterns 
of  studying  area,  production  and  productivity  of  rice  crop  for  Amalgamated  Raipur  districts  of  Chhattisgarh  Plains  over 
a  time  series  from  1980  to  2015  and  their  forecasts  for  the  next  five  years  using  ARIMA  and  double  exponential 
methods.  The  forecasts  undertaken  were  based  on  the  libraries  developed  in  the  globally  famous  statistical  software 
environment  “R:  The  R  Project  for  Statistical  Computing ”  available  at  www.r-vroiect.ors.  Some  good  models  have  been 
developed  for  the  area,  production  and  productivity  wherever  model  diagnostics  permitted.  A  synopsis  and  evaluation  of 
Good  fitted  forecast  models  have  been  summarized.  Subsequently,  forecast  based  on  these  good  models  have  been 
attempted  from  2015-16  to  2019-20. 
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INTRODUCTION 

Chhattisgarh  State  was  created  on  November  1,  2000  by  incorporating  16  districts  of  undivided  eastern 
Madhya  Pradesh.  It  shares  its  borders  with  six  states  of  the  country  -  Orissa  in  the  east,  Jharkhand  in  the  north-east, 
Madhya  Pradesh  in  the  west  and  north-west,  Uttar  Pradesh  in  the  north,  Maharashtra  in  the  south-west  and  Andhra 
Pradesh  in  the  south  and  south-east.  With  a  geographical  area  of  137.90  lakh  ha  (www.agridept.cg.gov.in). 
Chhattisgarh  is  the  tenth  largest  State  (https://en.wikipedia.org)  in  India,  contributing  about  2.97  per  cent  of  the 
total  annual  food  grains  production  in  India  (DASC,  2012-13).  The  State  has  a  net  sown  area  47.75  lakh  ha, 
cropping  intensity  138  percent  and  forest  coverage  of  63.53  lakh  ha.  After  creation  of  Chhattisgarh,  many  districts, 
and  many  erstwhile  districts  were  divided,  among  which  Raipur  was  divided  twice  in  1998-99  and  201 1-12.  Due  to 
these  divisions  the  divided  parts  of  the  corresponding  districts  suffered  due  to  data  availability  in  order  to 
determine  the  patterns  of  area,  production  and  productivity  of  different  crops.  So,  to  get  a  clear  picture,  the  areas 
and  productions  of  the  divided  parts  were  amalgamated,  and  productivity  averaged,  to  form  Amalgamated  Raipur, 
as  if  they  were  not  divided.  Thus,  the  entire  study  and  forecast  were  made  for  this  Amalgamated  Raipur. 

METHODS 

The  present  study  utilizes  a  relational  database  created  in  the  Department  of  Agricultural  Statistics  and 
Social  Science,  Indira  Gandhi  Krishi  Vishwavidyalaya,  Raipur,  Chhattisgarh,  on  the  basis  of  data  acquired  from 
www.agridept.cg.gov.in  and  Anonymous  (1981-1998).  The  detailed  methodologies  adopted  in  the  present 
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investigation  are  presented  in  the  following  sub-heads. 

ARIMA  MODEL 

In  time  series  analyses,  an  autoregressive  integrated  moving  average  (ARIMA)  model  is  a  generalization  of 
an  autoregressive  moving  average  (ARMA)  model.  Anonymous  (2016).  These  models  are  fitted  to  time  series  data  either 
to  better  understand  the  data  or  to  predict  future  points  in  the  time  series  (forecasting).  They  are  applied  in  some  cases 
where  data  show  evidence  of  non-stationary  and  where  an  initial  differencing  step  (corresponding  to  the  "integrated"  part 
of  the  model)  can  be  applied  to  reduce  the  non-stationarity.  Thus,  the  autoregressive  (AR)  component  of  the  ARIMA 
model  is  a  representation  of  a  type  of  random  process  representing  certain  time-varying  processes,  economical 
phenomenon  of  the  nature,  etc.  It  specifies  that  the  output  variable  depending  linearly  on  its  own  previous  values  and  on  a 
stochastic  term.  This  AR(p),  an  autoregressive  model  of  order  p,  can  be  defined  as, 

n=R+2f=1  (i) 

where  <#T, . 'fp  are  the  parameters  of  the  model,  /<  is  a  constant,  and  £~t  is  stochastic  white  noise. 

The  second  component  of  ARIMA  is  Moving-Average  (MA)  model.  The  general  MA  process  of  order  q  can  be 
defined  as, 

Yt  =  ft  +  H - 1-  Oq£t -q-  (2) 

where  p  is  the  mean  of  series,  the  ... ,  are  the  parameters  of  the  model  and  the  ft-v  are  the 

stochastic  noise  error  terms.  The  value  of  q  is  called  the  order  of  the  MA  model. 

Now,  combining  the  AR  process  with  the  MA  process  and  integrating,  with  differencing  step,  to  remove  non- 
stationarity  from  the  non-seasonal  time  series,  the  ARIMA  is  generally  denoted  by  ARIMA  (p,d,q),  where  parameters  p,  d, 
and  q  are  non-negative  integers,  p  being  the  order  of  the  autoregressive  model,  d  being  the  degree  of  differencing, 
and  q  being  the  order  of  the  moving-average  model.  In  other  words,  ARIMA  (p,d,q),  is  an  autoregressive  integrated 
moving  average  time  series. 

Thus,  the  ARIMA  (p,d,q)  model,  (Qbal,  N.  et  al.  2005)  can  be  represented  by  the  following  general  forecasting 

equation: 

=p+f'f_i  +t_.  +  (3) 

Or, 

=V+Yt-i  +t_.  (4) 

DOUBLE  EXPONENTIAL  SMOOTHING  MODEL 

The  double  exponential  smoothing  is  best  applied  to  time  series  that  exhibit  prevalent  additive  (non-exponential) 
trend,  but  do  not  exhibit  seasonality.  (Prajakta  S.  (2004)).  The  recursive  form  of  the  Holt’s  double  exponential  smoothing 
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equation  is  expressed  as  follows: 

Ft  (m)  =  5f  +  m  x  bt 
Sf  =aYf+  (l-a)(Sf_i  +  &t- 1) 
h  —  —St-i)  + 


Where, 

is  the  value  of  the  time  series  at  time  t; 

is  a  smoothed  estimate  of  the  value  of  the  time  series  Y  at  the  end  of  period  t; 
is  a  smoothed  estimate  of  average  growth  rate  at  the  end  of  period  t. 
a  is  the  level  smoothing  coefficient. 

P  is  the  trend  smoothing  coefficient. 

is  the  m-step-ahead  forecast  values  for  Y  from  time  t. 

The  estimates  and  forecasts  for  above  mentioned  models  have  been  computed  based  on  the  libraries  developed  in 
the  globally  famous  statistical  software  environment  “R:  The  R  Project  for  Statistical  Computing”  available  at  www.r- 
project.org.  The  authors  have  used  the  methods  relevant  to  these  as  illustrated  in  Coghlan,  A.  (2014,  2017). 

RESULTS  AND  DISCUSSIONS 

Based  on  the  methods  elucidated  above,  good  models  have  been  estimated  using  double  exponential  and  ARIMA 
models  as  listed  in  following  Table  1.1,  for  which  predictions  were  made: 

Table  1.1:  Synopsis  and  Evaluation  of  fitted  models  based  on  (i)  Model- 1:  Double  exponential  model  with 
parameters  a,  P,  a,  b  and  (ii)  Model-2:  ARIMA  model  with  parameters  ACF,  P-value,  AIC,  BIC,  o2;  and  respective 
diagnostic  plots  with  confidence  intervals. 


Crop 

Variables 

Region/District 

Model  Adequacy 

Rice 

Area 

Amalgamated 

Raipur 

Model- l:Good  fit 

Model-2:Good  fit 

Production 

Amalgamated 

Raipur 

Model- l:Good  fit 

Model-2:Good  fit 

Productivity 

Amalgamated 

Raipur 

Model- l:Good  fit 

Model-2:Good  fit 

So,  using  the  good  models  as  indicated  in  Table  1.1  above,  the  estimated  parameters  of  forecasting  models  for 
area,  production  and  productivity  under  the  rice  crop  for  Amalgamated-Raipur  for  the  period  1980-81  to  2014-15  have 
been  presented  along  with  the  relevant  forecasts  and  their  confidence  intervals  in  Figure  .1  to  Figure  .6  in  the  Appendix-A, 
respectively,  for  double  exponential  smoothing  forecasting  as  well  as  for  ARIMA  forecast.  These  are  followed  by 
respective  graphs  for  plots  of  model  diagnostics  of  (a)  observed  vs.  fitted  and  (b)  double  exponential  smoothing  forecast 
and  (c)  Time  Series  plot  and  (d)  ARIMA  Forecasting  etc. 

The  figures  Figure  .1  from  Figures. 6  in  the  Appendix-A,  can  be  perused  for  various  estimates  of  parameters  for 
levels,  trends  and  seasons  of  area,  production  and  productivity  of  Amalgamated-Raipur  under  rice  crop.  Thus,  from 
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Figures..  1  in  the  Appendix- A,  left  side  table,  for  area  of  rice  it  is  found  that  the  a  coefficient  of  double  exponential  model 
is  very  high  (1,  equal  to  maximum  1),  therefore  the  level  of  this  model  can  be  said  to  depend  more  on  the  recent 
observations  than  the  past  observations  during  the  period  of  study  from  1980-81  to  2014-15,  with  the  start  of  the  level  at 
772.3,  whereas  the  value  of  (3-coefficient  is  very  low  (0.05,  close  to  zero)  indicating  that  there  is  almost  no  trend  in  the 
double  exponential  model  estimate,  with  the  start  of  the  trend  at  a  low  value  of  3.96,  for  the  period  1980-81  to  2014-15. 
Secondly,  from  Figures.2,  left  side  graph,  it  is  clear  that  the  autocorrelation  (i.e.  ACF)  is  zero  because  p-value  >  0.80,  from 
Figure. 1.1,  left  side  table,  the  AIC/BIC  values  are  close  to  371-373  with  variance  of  2254,  i.e.,  standard  deviation  of  47.47 
thousand  ha..  The  forecast  of  the  area  under  rice  by  double  exponential  model  and  ARIMA  model  are  depicted  in  Figures.  1 
and  Figures.2  in  the  Appendix-A,  respectively,  for  80%  and  95%  confidence  intervals  for  five  years  from  2015-16  to  2019- 
2020.  On  perusing  these  graphs  it  may  be  concluded  that  forecasted  area  for  the  double  exponential  model  are  in  the  range 
of  (776.3069  to  792.1785)  000’ha  with  approximate  80%  confidence  interval  (CI80%)  of  (630-930)  000’ha  and 
approximate  95%  confidence  interval  (095%)  of  (550-1050)  000’ha.  On  the  other  hand,  the  forecasted  area  for  ARIMA 
model  is  772.339  000’ha  for  all  the  five  years  from  2015-16  to  2019-2020  with  080%  values  being  in  the  range  630-900 
000’ha  and  095%  range  550-1000  OOO’ha.  Clearly,  the  ARIMA  estimates  give  lower  confidence  intervals  with  pointed 
better  pointed  forecast. 

From  Figures. 3,  left  side  table,  for  production  of  rice  it  is  found  that  the  a  coefficient  of  double  exponential  model 
is  low  (0.48,  equal  to  maximum  1),  therefore  the  level  of  this  model  can  be  said  to  depend  more  on  the  both  recent 
observations,  with  the  start  of  the  level  at  1545.4,  whereas  the  value  of  (3-coefficient  is  very  high  (0.975)  indicating  that 
there  is  trend  in  the  double  exponential  model  estimate,  with  the  start  of  the  trend  at  a  value  of  179.9,  for  the  period  1980- 
81  to  2014-15.  Secondly,  from  Figures.  A. 4  in  the  Appendix-A,  left  side  graph,  it  is  clear  that  the  autocorrelation  (i.e. 
ACF)  is  zero  because  p-value  >  0.80,  from  Figures. 1.3,  left  side  table,  the  AIC/BIC  values  are  close  to  472.71-  491.9  with 
variance  of  33149,  i.e.,  standard  deviation  of  182.06  000’  tonnes.  The  forecast  of  production  under  rice  by  double 
exponential  model  and  ARIMA  model  are  depicted  in  Figures. 3  and  Figures.4  in  the  Appendix-A,  respectively  for  80% 
and  95%  confidence  intervals  for  the  five  years  from  2015-16  to  2019-2020.  On  perusing  these  graphs  it  may  be  concluded 
that  forecasted  production  for  the  double  exponential  model  are  in  the  range  (1725.416  to  2445.13)  000’tonnes  with 
approximate  80%  confidence  interval  (CI80%)  of  (1200-3600)  000’tonnes  and  approximate  95%  confidence  interval 
(095%)  of  (600-4200)  000’tonnes.  On  the  other  hand,  the  forecasted  production  for  ARIMA  model  are  in  the  range 
1454.513-  1531.511  000’tonnes  for  all  the  five  years  from  2015-16  to  2019-2020  with  080%  values  being  in  the  range 
1000-2000  000’tonnes  and  095%  range  700-2300  000’tonnes.  Evidently,  the  ARIMA  estimates  give  better  pointed 
forecast  with  lower  confidence  intervals. 

Next,  from  Figures. 5  in  the  Appendix-A,  left  side  table,  for  productivity  of  rice,  the  a  coefficient  of  double 
exponential  model  is  low  (0.51,  equal  to  maximum  1),  so  the  level  of  the  model  depends  more  on  the  recent  observations, 
starting  at  1545.4,  the  P-coefficient  being  very  low  (0.43)  indicating  no  trend  in  the  double  exponential  model  estimate, 
with  the  intercept  of  1956.7,  for  the  period  1980-81  to  2014-15.  Secondly,  from  Figures. 6  in  the  Appendix-A,  left  side 
graph,  the  autocorrelation  (i.e.  ACF)  is  zero  because  p-value  >  0.80,  from  Figures. 5,  left  side  table,  the  AIC/BIC  being 
close  to  490.08  -  493.19  with  variance  of  63355,  i.e.,  standard  deviation  of  251.70  kg.  The  forecast  of  productivity  under 
rice  by  double  exponential  model  and  ARIMA  model  are  depicted  in  Figures. 5  and  Figures. 6  in  the  Appendix-A, 
respectively  for  80%  and  95%  confidence  intervals  for  the  five  years  from  2015-16  to  2019-2020.  On  perusing  these 
graphs  it  is  concluded  that  productivity  for  the  double  exponential  model  are  forecasted  in  the  range  of  2036.08  to  2353.25 
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kg  with  approximate  080%  of  1200-3400  kg  and  095%  of  700-3900  kg.  On  the  other  hand,  the  forecasted  productivity 
for  ARIMA  model  are  in  the  range  1858.49  -  1934.559  kg  for  all  the  five  years  from  2015-16  to  2019-2020  with  080% 
values  being  in  the  range  1400-2400  kg  and  095%  range  1100-2700  kg.  On  comparison,  the  ARIMA  estimates  gives 
lower  confidence  intervals  with  better  pointed  forecast. 

Finally,  based  on  Figures  .1-.6,  a  clear  picture  emerges  in  terms  of  a  great  diversification  of  the  area  under  rice  to 
other  crops  from  the  year  1998-99  onwards  in  Amalgamated  Raipur,  which  by  the  way  coincides  with  the  first  split  of 
Raipur  district  in  the  same  year  1998-99  too.  Due  to  these  factors  the  area  and  production  of  rice  take  a  sudden  dip  in  1998- 
99  and  then  gradually  rise  onwards.  However,  the  productivity  maintained  a  rising  trend  after  a  few  years  of  stationary 
following  diversification  of  rice  area  to  other  crops  in  Raipur  district  and  the  first  split  of  Raipur  district,  in  1998-99, 
possibly  as  a  result  of  policy  paralysis  emerging  out  of  shock  due  to  above-said  diversification  along  with  a  split  in  the 
district.  Despite  these,  the  production  of  rice  in  Amalgamated  Raipur  in  2014-15  has  now  crossed  the  peak  of  rice 
production  compared  to  what  it  was  just  before  the  start  of  diversification,  in  1998-99,  of  rice  area  to  other  crops  in  Raipur 
district. 

CONCLUSIONS 

It  may  be  concluded  from  the  present  study  that  ARIMA  models  fitted  better  than  the  double  exponential  models 
uniformly,  as  depicted  from  Figures  .1-.6  in  the  Appendix-A,  for  area,  production  and  productivity  of  rice  under 
Amalgamated  Raipur,  in  terms  of  all  the  standard  diagnostics’  estimates  like  AIC,  BIC,  p- values,  confidence  intervals 
(CIs),  autocorrelation  functions  (ACFs)  on  the  one  hand;  and  graphical  diagnostics  on  the  other  hand,  like  observed  vs 
fitted  graphs  and  ARIMA  ACF  graphs.  It  is  further  concluded  that  the  area  and  production  of  rice  take  a  sudden  dip  in 
1998-99  indicating  a  great  deal  of  diversification  of  rice  area  to  other  crops.  However,  both  parameters  gradually  rise 
onwards.  Yet,  the  productivity  maintained  a  rising  trend,  after  a  few  years  of  stationary  following  diversification  in  the  rice 
area  to  other  crops  in  Raipur  district  and  the  first  split  of  Raipur  district,  in  1998-99,  possibly  as  a  result  of  policy  paralysis 
emerging  out  of  shock  due  to  above-said  diversification  along  with  a  split  in  the  district.  Despite  these,  the  production  of 
rice  in  Amalgamated  Raipur  in  2014-15  has  now  crossed  the  peak  of  rice  production  compared  to  what  it  was  just  before 
the  start  of  diversification,  in  1998-99,  of  rice  area  to  other  crops  in  Raipur  district. 
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APPENDIX-A 


Figure  1:  Estimated  Forecast  Models  of  Area  of  Amalgamated  Raipur 
Under  Rice  for  1980-81  to  2014-15:  (a)  Observed  vs.  Fitted  and 
(b)  Double  Exponential  Smoothing  Forecast 
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Forecasting  of  area  for  Amalgamated  Raipur  under  Rice  for 
YEARS  FROM  2015-16  TO  2019-20 
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Figure  2:  Estimated  Forecast  Models  of  Area  of  Amalgamated  Raipur  under 
Rice  for  1980-81  to  2014-15:  (c)  Time  Series  and  (d)  ARIMA  Forecasting 
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Figure  3:  Estimated  Forecast  Models  of  Production  of  Amalgamated 
Raipur  under  Rice  for  1980-81  to  2014-15:  (a)  Observed  vs.  Fitted  and 
(b)  Double  Exponential  Smoothing  Forecast 


Figure  4:  Estimated  Forecast  Models  of  Production  of  Amalgamated  Raipur 
Under  Rice  for  1980-81  to  2014-15:  (c)  Time  Series  and  (d)  ARIMA  Forecasting 
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Double  Exponential  Smoothing  FORECAST  OF  PRODUCTIVITY  FOR  AMALGAMATED 
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Figure  5:  Estimated  Forecast  Models  of  Productivity  of  Amalgamated  Raipur 
Under  Rice  for  1980-81  to  2014-15:  (a)  Observed  vs.  Fitted  and 
(b)  Double  Exponential  Smoothing  Forecast 
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ARIMA  Diagnostic  Graph 


ARIMA  FORECAST  OF  PRODUCTIVITY  FOR  AMALGAMATED  RAIPUR  UNDER 


Rice  for  next  5  years  from  2015-16  to  2019-20 
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Figure  6:  Estimated  Forecast  Models  of  Productivity  of  Amalgamated  Raipur  under 

Rice  for  1980-81  to  2014 
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